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CHIMAERIC GENE CODING FOR A 
TRANSIT PEPTIDE AND A 
HETEROLOGOUS POLYPEPTIDE 


This application is a continuation of application Ser. No. 
08/267 .306 file Jan. 29, 1994, now abandoned, which is a 
continuation of application Ser. No. 08/026.213 filed Mar. 1. 
1993. now abandoned, which is a continuation of application 
Ser, No. 07/794.635 filed Nov. 18, 1991, now abandoned, 
which is a continuation of application Ser. No. 07/480,343 
filed Feb. 14, 1960, now abandoned, which is a continuation 
of application Ser. No. 06/755.163 filed Jul. 15, 1985, now 
abandoned. 


FIELD OF THE INVENTION 


The invention relates to means, particular recombinant 
vectors, and to processes for the controlled introduction of 
foreign gene products into plant chloroplasts. 

The disclosure which follows contains reference numbers 
in the form of exponents. They refer to bibliographic refer- 
ences relative to literature referred to at the end of this 
specification. Other literature is also referred to in the course 
of this description by the name of the first author and date 
of publications. All the articles as well as the patent 
applications, patents, ctc., which shall be referred to 
throughout this specification are incorporated herein by 
reference. 

Tt is well known that the cells of eukaryotic organisms, 
and more particularly plant cells, contain distinct subcellular 
compartments, or organelles, delimited by characteristic 
metubrane systems and performing specialized functions 
within the cell In photosynthetic leaf cells of higher plants 
the most conspicuous organelles are the chloroplasts, which 
exist in a semi-autonomous fashion within the cell, contain- 


ing their own genetic system and protein synthesis ~ 


machinery, but relying upon a close cooperation with the 
pucleo-cyteplasmic system in their development and bio- 
synthetic activities’. 

The most essential function of chloroplasts is the perfor- 
mance of the light-driven reactions of phctosynthesis. But 
chloroplasts also carry out many other biosynthetic pro- 
cesses of importance to the plant cell. For example, all of the 
cell's fatty acids are made by enzymes located in the 
chloroplast stroma, using the ATP. NADPH, and carbohy- 
Grates readily availabie there. Moreover, the reducing power 
of light activated electronic drives the reduction of nitrite 
(NO~,) to ammonia (NH,) in the chloroplast; this ammonia 
provides the plant with nitrogen required for the synthesis of 
amino acids and nucleotides. 

the chicroplast also takes part in processes of particular 
concera to the agrochemical industry. 

Particularly it is known that many herbicides act by 
blocking functions which are performed within the chloro- 
plast. Recent studies have identified the specific target of 
several herbicides. For instance, triazine derived herbicides 
inhibit photosynthesis by displacing a plastoquinone mol- 
ecule from its binding site in the 32 Kd polypeptide of the 
photosystem IL This 32 Kd polypeptide is encoded in the 
chloroplast genome and synthesized by the organelle 
machinery. Mutant plants have been obtained which are 
resistant to triazine herbicides. These plants contain a mutant 
32 Kd protein from which the plastoquinone can no longer 
be displaced by triazine herbicides. 

Several other herbicides are known to block specific steps 
in amino acid synthesis. Sulfonyl-ureas are known to inhibit 
acetolactate synthase. This enzyme is involved in isoleucine 
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and valine synthesis. Glyphosate inhibits the function of 
$-enol pyruvy]-3-phosphoshikimate synthase, which is an 
enzyme involved in the synthesis of aromatic amino acids. 
All these enyzmes are encoded by the nuclear genome, but 
they are translocated into the chloroplast where the actual 
amino acid synthesis takes place. 

Enzymes responsible for the same functions are also 
present in prokaryotes. It should be easy to obtain bacterial 
mutants in which the enzyme of interest is no longer 
sensitive to the herbicide. Such a strategy was used with 
success to isolate Salmonella typhimurium mutants with an 
altered aro A gene product, which confers resistance to 
glyphosate (Comai et al Science 221, 370 (1983). 

Thus the use of chloroplastic or bacterial genes to confer 
herbicide resistance to plant cells could be successful if their 
gene products were efficiently transported into the chioro- 
plast where they function. 

Chloroplasts are also involved in the complez mecha- 
nisms which regulate the levels of amino acid synthesis. One 
of the most important regulatory mechanisms is the 
so-called retroregulation. This mechanism involves the inhi- 
bition of the key enzyme of a given pathway by the end 
product(s) of this pathway. When a key enzyme is no longer 
subjected to such regulation the organism overproduces ihe 
cosresponding end product (e.g.. an amino acid). 

Isolation of mutant genes encoding for enzymes that are 
insensitive to inhibition by the corresponding end product is 
well documented in bacteria. Similar mutuants in plant cells 
are difficult to obtain and only a few examples have been 
reported. Furthermore the isolation of genes from plant cells 
is a very complex task when compared to the isolation of 
bacterial genes. ; 

As mentioned earlier, most amino acid syntheses take 
place inside the chloroplast. 

Thus, there is a great interest for the development of a 
technique for transforming plant cells with bacterial genes 
encoding an enzyme insensitive to inhibition by the above- 
said end product in a way such that the result of this 
transformation process would be the introduction of said 
enzyme in the plant chloroplasts. The ultimate result of this 
process would be an over-production of amino acid. 

These are but a few examples (additional examples will 
be mentioned later) of the prospects of considerable devel- 
opment of plant genetic engineering which will be at hand 
for the specialists as soon as practical techniques suitable for 
the introduction of determined foreign polypeptides or pro- 
teins in chloroplasts become available. 

Indeed, many techniques have been proposed for the 
transfer of DNA to plants such as direct DNA uptake, 
snicro-injection of pure DNA and the use of viral or plasmid 
vectors. Plasmid vectors which have proven particularly 
efficient are those derived from tumor-inducing (Ti) plas- 
mids of the microorganism Agrobacterium tumefactens 
which is the agent of crown gall disease in dicotyledonous 
plants. Those plasmids can be modified by removal of the 
tumor-causing genes from their T-DNA. The so modified 
plasmids then no longer interfere with normal plant growth 
and differentiation and. upon insertion of a determined 
foreign gene at an appropriate site of said plasmids, can be 
used for promoting the expression of the protein encoded by 
said determined gene in plant cells. Particularly. the foreign 
gene may be inserted close to one of the border sequeaces 
or between the two border sequences which surround the 
T-DNA in intact Ti-plasmids. Reference can be made by way 
of examples to the articles of: 


A. CAPLAN et al. titled “Introduction of Genetic Mate- 
rial into Plant Cells". Science, 18 Nov. 1983. volume 
222. pp. 815-821; 
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L. HERRERA-ESTRELLA et al titled “Expression of 
chimaeric genes transferred into plant genes using a 
Ti-plasmid-derived vectors”. Nature, vol. 303, No. 
5914, pp. 209-213, 19 May 1983; 

L. HERRERA-ESTRELLA et al. titled “Light-inducible 
and chloroplast associated expression of a chimaeric 
gene introduced into Nicotiana tabacum using a 
Ti-plasmid vector”. Nature, vol. 310, 275973. pp. 
115-120, 12 Jul 1984; 

or to European Patent Application N°0 116 718 (or to US. 
Application n°570.646) or to the International Appli- 


cation WO 84/02913 published under the PCT; all of | 


these articles or patent applications being incorporated 
herein by reference. 

Yet all of these techniques do not provide an efficient and 
zelatively easy method of transformation of chicroplasts, 
despite the considerable work which has been devoted to the 
subject and the already large amount of knowledge which 
has been acquired, particularly concerning the production of 
proteins in plant cells and transfer into the chloroplasts. As 
4 mnatter of fact yectors for the direct transformation of 
chloroplasts are unavailable at this time, Furthermore 
rnature proteins, including those normally encoded in the 
plant cells including said chloroplasts and which can ulti- 

- mately be isolated naturally from said chloroplasts, cannot 
as such be caused to penetrate in the chloroplasts if supplied 
thereto from outside. 

Most chloroplast proteins are coded for in the nuclear 
DNA and are the products of protein synthesis on cytoplas- 
mic ribosormes. many as soluble higher molecular weight 
precursors*~”. These precursors are then translocated 
through either one or both of the plastid envelope 
membranes. processed, and assembled into their final 
organellar compartment or holoenzyme complex. In vitro 
reconstitution experiments using isolated chloroplasts. have 
demonstrated that the uptake and processing of over one 
hundred nuclear-encoded. cyteplasmically synthesized pre- 
cursors by chloroplasts occurs by an energy-dependent’’, 
post-translational mechanism®, *O-!7 

The most extensively characterized of these nuciear- 
encoded chloroplast proteins is the small subunit of nbulose- 
1.5-bisphosphate (RUBP) carboxylase. This polypeptide is 
synthesized on free cytoplasmic nbosomes as a precursor of 
70.000 daltons containing an amino terminal extension or 
transit peptide of approximately 56.000 daltons oe 
During or immediately after impest of the precursor into the 
chloroplast. the transit peptide is proteolytically remoyed in 
two steps by a soluble protease’*, yielding a mature small 
subunit polypeptide of 15.000 daltons. This polypeptide is 
then assembled with an endogenous large subunit into the 
functional RUBP carboxylase holoenzyme** **. 

Similar observations were made with the chlorophyll a/b 
binding proteins. These polypeptides are synthesized as 
soluble precursors on cytoplasmid mbosomes {Apel and 
Kloppstech, 1978; Schmidt et al.. 1981) and are post- 

translationally translocated into chloroplasts. During or after 
translocation the NH.-terminal transit peptides are pro- 
teolytically cleaved (Schmidt et al, 1981) to yield the 
mature polypeptides. The mature A and B polypepides 
associated with chicrophyll a and b are integrated into the 
thylacoid membrane. The transit peptides of post transla- 
tionally transported chloroplast proteins are characterized by 
a preponderance of basic amino acids. a feature which has 
been proposed as important in the interaction of the transit 
peptide with the chloroplast envelope”. Comparison of 
transit peptides of small subunit precursors from yarious 
plant species show a variation in amino acid sequence, but 
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a relatively strong conservation in the position of prolines 
and charged amino acid tesidues*”~ and a substantial 
homology in a region surrounding the cleavage site of the 
precursors, as observed in soybean (Berry-Lowe et al., 
1982) pea (Cashmore, 1983), duck weed (Stickema ct al., 
1983) and wheat (Broglie et al; 1983). These common 
properties may be of functional significance since both in 
vitro!!? and in vivo”, the small subunit precursors from 
one plant species can be imported and comectly processed 
by the chloroplasts of others and vice-versa. 

‘The molecular basis of how the post-translational trans- 


location of polypeptides into chloroplasts occurs and which 


signals are involved in this process. more particularly the 
relative contributions of the transit peptide and the mature 
protein to the uptake and processing mechanism are still not 
fully understood, even though it was already presumed that 
the transit peptide is required for the translocation of the 
mature protein. Consistent with this is the observation that 
the mature small subunit protein is not translocated into 
chloroplasts**, 

The invention stems from several discoveries which have 
been made by Applicants. as a result of further studies of the 
translocation mechanisms through the chloroplast mem- 
branes of chioroplast-protein precursors encoded by the 
nuclear DNA of plant cells. It seems that no cytoplasmic 
factor is required for the translocation mechanism itself as a 
result of further studies carried out on RuBP. | 

Further it has been found that all the sequence information 
required for translocation and transport of the mature protein 
or of a subunit thereof through the chloroplast membranes 
seems to reside within the precursor subunits and even 
within the sole transit peptides normally associated there- 
with. It further appeared that transit peptides not only 
mediate translocation, but also include infermation neces- 
sary for site-specific processing of the corresponding pro- 
teins. 

These different properties of the transit pepddes are at the 
basis of the recombinant DNAs, more particularly recom- 
binant vectors including a DNA sequence coding for a 
determined protein or polypeptide, particularly a forcign 
protein, sought to be introduced and processed in 
chloroplasts, as well as the processes for the introduction of 
such foreign polypeptide or protein into the chloroplasts. for 
instance in the tylaccid membranes or, preferably, in the 
stroma thercof. As a matter of fact an essential element of 
these recombinant vectars consists of a DNA sequence 
coding for a transit peptide. it being understood that this 
expression, as used throughout this specification and claims 
attached thereto, designates the amino acid sequence con- 
tained in any chloroplast protein precursor which, upon 
import of the precursor, is proteolytically removed during or 
immediately after import of the precursor into the chloro- 
plast to yield either the corresponding functional mature 
protein or a subunit thereof, particularly when. like in the 
case of RuBP, the final processing of the mature protein 
takes place within the chloroplast. Such final processing 
comprises, for instance, the assembling of said subunit with 
another endogenous subunit to yield the final functional 
protein. 


SUMMARY OF THE INVENTION 


The recombinant DNA according to the invention which 
can be introduced into plant cells is characterized by the 
presence therein of a chimaeric gene comprising a first 
nucleic acid aud a second aucleic acid recombined with each 
other. said first nucleic acid and said second nucleic acid 
being of different origins. particularly being forcign to each 
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other. wherein said first nucleic acid contains a first coding 
sequence which has essential sequence homology with a 
natural gene sequence coding for a transit peptide belonging 
to a precursor comprising at least the N-terminal sub-unit of 
a chloroplast protein capable of being transported into or 
processed within the chloroplast of said plant cells and 
wherein said second nucleic acid contains a second coding 
sequence distinct of the gene sequence coding for said 
chloroplast protein or chloroplast protein sub-unit, said 
second nucleic acid being located downstream of said first 
nucleic acid in the direction of the transcription of their first 
and second sequences respectively. 


BRIEF DESCRIPTION OF THE DRAWINGS 


FIG. LA diagrammatically represents the successive steps 
of the construction of preferred recombinant DNAs, includ- 
ing a vector suitable for the transformation of plant cells, 
containing a chimaeric gene according to a first preferred 
embodiment of this invention; 

FIG. 1B represents the structure of the characteristic 
portion of the-chimaeric gene according to this invention and 
included in the abovesaid recombinant DNAs; 

FIG. 2 shows the results obtained in Southern hybridiza- 
tion experiments with recombinanst DNAs accarding to the 
invention. in relation to the detection of the incczporation of 
the abovesaid chimaeric gene in the genome of plant cells; 

FIG. 3 is a schematic representation of the organization of 
the gene fusion and the plant-vector sequences of the vector 
of FIG. LA, 2s modified by said chimaeric gene; 


6 


pGV3851pGSSTneo3-iransformed tobacco cells to pro- 
tease treatment as described in Example WH; 

FIG. 12 is an autoradiogram showing the localization of 
NPTOD activity in the stromal fraction of chloroplasts 
isolated from pGV3851:pGSSTneo3-transformed tobacco 
tissne as described in Example I; 

FIG. 13 is an autoradicgram showing the in vitro uptake 
of TP-NPT(D fusion protein by isolated pea chloroplasts as 
described in Example 1; 

FIG. 14 shows the construction of a plasmid containing a 
chimeric gene encoding the TP-NPT(ID fusion protein and 
wherein the coding sequences are under control of a foreign 
promoter. 


DESCRIPTION OF THE PREFERRED 
EMBODIMENTS 


In joint efforts of the inventers to solve the problem 
sought, i.e. providing methods and means for transporting a 
protein into chloroplasts, the inventors have devised two 
main approaches which have in common the use of recom- 
binant DNAs all including a sequence coding for a transit 
peptide. These two approaches resulted in the two preferred 
embodiments which will be exemplified hereafter and which 
both proved to be effective. | 

Example 1 is illustrative of the first approach which tock 
into account the possibility that a larger part of the nuclear 
genes, including the entire.regioa of high homology around 
the cleavage site of the precursors would be a necessary 
requirement for transport and processing of proteins, par- 


30 
FIG. 4 shows the results obtained in RNA-hybridization ticularty foreign proteins, into chloroplasts. This resulted in 
experiments carried out in relation to the detection of the a first series of preferred recombinant DNAs of this inven- 
transcriptional activity of the promoter included in the tion more particularly characterized in that the first nucleic 
chimacric gene of the invention; acid as defined hereabove contains a third sequence corre- 
FIG. SA and 5B show comparative results of transcription 35 Sponding to at least part of a nucleic acid encoding the 
experiments under the control of a light-dependent promoter N-terminal cytoplasmic subunit of a chlor oplast protem 
in plant materials transformed by the recombinant DNAs of downstream 0: said first sequence and in that the extremity 
the invention; | of said third sequence 1s substantially contiguous to the 
FIG. 6A is representative of the results obtained in experi- extremity of said first nucleic acid. 
ments purporting to demonstrate the transport of the prod- 40 Preferably the third sequence does not extend beyond the 
ucts encoded by the above said chimaeric gene into the nucleotides encoding the N-terminal extremity of the cyto- 
chloroplasts of plant cells; plasmic subunit of ire peeelore ae “be it oe 
FIG. 6B is a graphic display of the relative mobility of the an intron, particularly that which initially belonged to the 
Gjiferent activities of the gene-product detected in FIG. 6A: same gene as the exons encoding the peptidic portions which 
ae a5 will ultimately provide the precursor subunit of the corre- 
FIG. 7 iustrates results obtained in assays (to show the sponding natural chloroplast protein 
. m a t = o Ces ind — 7 ag Pd dl . Zz - 
ap repens ession of the fusion protein encoded by ‘Tt will be seen that a preferred embodiment of a recom- 
ai ee ee _ binant DNA cosresponding to the abovesaid first approach 
FIG. 8A dag ‘os u a oi iho eee fetes inchides a first sequence encoding the transit peptide and a 
the construction OF preferre® TES) ment AS according .. third sequence which initially belonged to the same gene and 
10 a second prefered embodiment of the invention, as well which encodes the first 22 amino acids of the small subunit 
as of other recombinant DNAs for study purposes; gene (rbcS) from Pisium sativum (Cashmore, 1983), said 
FIG. 8B represents the amino acid sequences encoded by third gene being then fused to the coding region of a foreign 
x porn oft lace pan Gagne thes tp, ich ot in 20 Eo oes Tas 
: ~ 55 phosphotransicrase np gene 0 om a Ind 
coding for the selected transit peptide of a gene encoding the hn: 
amino ipeorie neem eee gis eaaiaeeh Example Il is illustrative of the construction which can be 
ferase eas en ‘ 7 = pore a chee made upon taking the second approach to solve the same 
a si pees oblems. In this construction, the aptIl coding sequence is 
FIG. 9 shows the results of Southern hybridization analy- gq fied directly to the transit peptide coding sequence such 
sis of Agrobacterium and plant DNA as described in that the potential protein cleavage site does not contain any 
Example I. amine acids derived from the mature small subunit protein 
FIG. 10 is an autoradiogram showing the localization of except the jnethionine following the last amino acid of the 
NPT(ID activity in chloroplasts of tobacco callus tissue as transit peptide. More generally, and preferably, the first 
described in Example If; és codon of the second sequence (encoding the protein. pat- 


FIG. 10 is an autoradiogram showing the protection of the 
NPT{II) activity present within chloroplasts of 


ticularly a foreign protein, which is sought to be translocated 
in the chloroplasts is immediately adjacent to the last codon 
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of said first DNA sequence coding for said transit peptide. 
Thus, particularly when the abovesaid second sequence 
encodes a polypeptide or protein different from the chioro- 
plast protein normally associated with the transit peptide 
encoded by said first sequence, the nucleotide sequence neat 
to said first sequence in said chimaeric gene is generally 
(possibly except for a first codon coding for methionine) be 
free of sequence homology with the nucleotide sequence 
encoding the N-terminal part of the normal chloroplast 
protein. Yet. the last codon of the first sequence and the first 
codon of the second sequence may be separated by any 
number of nucleotide triplets, preferably in the absence of 
any intron or stop codon. For instance, a “third sequence” 
encoding the first amino acids of the mature protein nor- 
mally: associated with the transit peptide concerned in the 
corresponding natural precursor (particularly those encoded 
by the exon containing the first sequence encoding said 
transit peptide) may be present between said first and second 
sequences. This “third sequence” may consist of the region 
of high homology which the N-terminal parts of cytoplasmic 
precurscr-subunits of chloroplast proteins form soybean, 
pea, duck-weed and wheat have in common. For instance 
such “third sequence” (be it in the constructions resulting 
from the “first approach” or those from the “second 
r ” considered hereabove) encodes the pentapeptide 
sequence M-Q-V-W-P. These letters correspond to the stan- 
dard one-letter-abtreviated designations of the natural 
amino acids. Obviously other “third sequences” of 
nucleotide-sequences can be contemplated upstream or/and 
also downstream of the above-defined second sequence to 
the extent where the amino acid sequences encoded are not 
likely to alter significantly the biological properties of the 
hybrid protein thea formed and translocated into the chlo- 
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8 
ensuring the translocation into the chloroplasts or any pro- 
tein or protein fragment of controlled amino acid sequence, 
for instance a bacterial protein or protein fragment or a 
syathetic polypeptide free of hybridization with any detcr- 
mined peptidic sequence also possessed by a chloroplast 
protein or precursor. 


Tt must be understood that in the preceding definitions 
“transit peptide” has the broad meaning indicated here- 
above. The transit peptide may further be selected depending 
upon the plant species which is to be transformed, although, 
as mentioned earlier, wansit peptides or smaller sub-unit 
precursers containing such transit peptides are often not 
plant-specific. Sub-unit precursor from one plant species can 
often be imported and directly processed by the chloroplasts 
of another. . 


Preferred DNA sequences encoding a transit peptide for 
use in the recombinant DNAs of this invention comespond 
to any of those encoding a transit peptide associated with the 
small sub-unit of RuBP of pea cells, or also of wheat or 
soybean cells. 


Preferred “first nucleotide sequences” coding for transit 
peptides are defined hereafter, merely by way of examople. it 
must be understood that the letters above the lines refering 
to the nucleotide sequence per se designate the successive. 
amino acids encoded by the successive triplets of the nucle- 
otide sequence. The letters below said line comespond to 
designations of nucleotides which can be substituted for 
those designated immediately above them in the nucleotide 
sequence: a 
A 


¥ er iT j%Y¥ $5 


ATG GCT TCT ATG ATA TCC TCT TCC GCT GTG ACA ACA GTC ACC 
Q@ 8s . 


CGT GCC TCT AGG GGG CAA TCC GCC GCA GTG CCT CCA TEC CoC 


R A S R G A A VW A P #B 


G . 


a 2 G 


G LEK s M tT G F P ¥ K K ¥ HN 


GOC CTC AAA TCC ATG ACT GGA TTC CCA GIG AAG AAG CTC AAC 
es 


> wo & tT $§ Tf T s N G@ G@ R VY S&B 


ACT GAC ATT ACT TCC ATT ACA AGC AAT GGT GGA AGA STA AAG 


Cc 


TGC 


roplasts. Yet in most preferred constructions according to 35 


that type the second sequence is preferably fused in phase- 
register with the first sequence directly contiguous thereto as 
mentioned earlier or separated therefrom by no more than a 
shert peptide sequence, such as that encoded by a synthetic 
nucleotide-linker possibly used for achieving the fusion. As 
sbown by Example If such a construction is then capable of 


Al45 


x 
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Of course DNA sequences coding for other transit pep- 
tides can also be used for the construction of the chimaeric 
gene of this invention. For instance “first sequences” within 
the meaning of this application may consist of a sequence 
encoding the transit peptide of the light harvesting chloro- 
phyll af-protein complex. normally located in thylakoid 
membranes. such as: 
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ATG GCC GCA TCA TCA TCATCA TCC ATG GCT CTC TCT TCT CCA 


rT L A G EF QL K LN P S §S 


Q 


ACC TTG GCT GGC AAG CAA CTC AAG CTG AAC CCA TCA AGC CAA 


= 2 G A A R P T 


GAA TTG GGA GCT GCA AGG TTC ACC 


The DNA sequence coding for the twansit peptide is 
advantageously the natural nuclear DNA gene portion or a 
cDNA obtained from the corresponding mRNA. 

Needless to say that any other DNA sequence encoding 
similar amino acid sequences can bé substinited therefor. It 
may for instance be contemplated to use a synthetically 
produced DNA sequence in which some of the codons differ 
from corresponding codons in the natural DNA sequence, 
while nevertheless coding for the same corresponding ami- 
noacids. In that respect the expression “transit peptide” 
should also be understood as extending to any peptide which 
could differ from a natural transit peptide at the level of 
some of the aminoacids, to the extent where the substitutions 
contermplated would not alter the operability of the resulting 
peptide to promote the translocation into the chloroplast of 
the foreign polypeptide or protein encoded by the DNA 
sequence associated with or adjacent to the sequence encod- 
ing such peptide. Thus, the chimaeric genes of the invention 
may be constructed with any “first sequence” having sub- 
stantial sequence homology with a natural DNA sequence 
encoding a natural transit peptide. 

Conceming the protein or polypeptide encoded by the 
abovesaid “second sequence”, it should also be understood 
that it may consist of any protein or polypeptide sought to be 
introduced into or processed within the chloroplasts of 
determined plants. Therefore the DNA sequence encoding it 
is usually forcign to or heterologous with respect to the DNA. 
sequence encoding the polypeptide or protein normally 
associated with the chosen transit peptide. In other words, 
the first and second DNA sequences usually originate from 
different sources. Particularly the second sequence encodes 
a foreign protein or polypeptide. for instance. of bacterial 
origin. But, the invention also extends to proteins that are 
naturally endogenous to chioroplasts of planis other than ine 

“determined plant” considered hereabove or even to chio- 
roplast proteins corresponding to natural chioroplast pro- 
teins of the same plant, yet differing therefrom only by a few 
amino acids (“mutated” protein). Techniques for directing 
such mutation (whether in the first or the second sequences) 
are exemplified in a recent paper of S. Gutteridge et al. titled 
“A site specific mutation within the active site of ribulose- 
1.5-bisphosphate carboxylase of “Rhodospirillum rubrum” 
(1984). 

Furthermore.the chimaeric gene of a preferred recombi- 
- nant DNA accerding to the invention comprises a prometer 
region upstream of the above mentioned fused sequences, in 
such manner that, when said chimaeric gene is inserted 
within an appropriate vector. the transcription of both the 
abovesaid first and second sequences are under the control 
of said promoter. The promoter region conteroplated here- 
above should of course be selected among those which are 
recognized by the polymerases endogenous to the plant 
sought to be transformed. Of particular advantage are the 
promoters effective in pea. wheat, soybean or tebacco cells. 
The promoter may be that normally associated with the 
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sequence encoding the chosen transit peptide. It may how- 
ever also be different An example of construction using 
another promoter is illustrated later in the examples. For 
instance, suitable promoters are those belonging to the genes 
of plasto-cyanin, ferredoxin-NADP” oxydoreductase, etc. 
Other suitable promoters are exemplified in the literahure 
referred to in this application. 

Preferably the sequence coding for the transit peptide is 
under the direct control of the selected promoter. This means 
that the first nucleotide triplet transcribed and expressed 
under the control of said promoter is preferably that of the 
sequence encoding the transit peptide. This of course is not 
critical, for instance, as evidenced by the first example. 

Finally the invention also relates to recombinant yectcrs, 
particularly plasmids which can be introduced and main- 
tained in plant cells and containing the abovesaid chimaeric 
gene, including the above-defined promoter region. 

Preferred vectors of this type are those derived from the 
Ti-plasmids referred to hereabove. More particularly. a 
preferred vector of this type comprises in addition to said 
Chimaeric gene a DNA fragment suitably positioned with 
respect to said foreign gene and having essential sequence 
homology with the DNA of a Ti plasmid including a T-DNA 
fragment. the sequences encoding the essential functions 
capable of causing the transfer of said T-DNA fragment and 
said chimeric gene into said plant cells, Particularly. a 
preferred vector according to the invention contains a 
T-DNA border sequence and said chimaeric gene is posi- 
tioned close thereto. Even more preferred vectors of this 
type comprise two border sequences. the chimaeric gene 
then being positioned between these two border sequences. 
Concerning general methods for inserting the chimaeric 
gene in Ti-plasmids, reference is made to the patents referred 
to above by way of examples. . | 

Advantageously the recombinant DNA (be it the chimac- 
ric gene as such of the vector which contains: it) should 
preferably include the appropriate site for the initiation of 
the corresponding RNA transctiption upstream of the first 
codon to be translated, in most cases an ATG codon. It is also 
of advantage that the recombinant DNA comprises down- 
stream of the foreign gene to be expressed appropriate 
transcription termination and polyadenylation signals. 

The invention also concerns a process for achieving and 
controlling the introduction of a determined protein or 
polypeptide (or fragment of said protein or polypeptide) into 
the chicroplasts of determined plant cells. Any suitable 
process for performing this introduction can be resorted to. 
Advantageously use is made of vectors of the type exem- 
plified and modified by a chimaeric gene according to the 
inyention and comprising a “second coding sequence” 
encoding a protein or polypeptide. But any other process can 
be resorted to. For instance a chimaeric gene according to 
the invention can be inserted in plant cells simply by the 
calcium chiccide polycthylenglycol precipitation methods or 
also by microinjection into the plant cells. 
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Additional features of the inyention will appear in the 
course of the following disclosure of the conditions under 
which the structural requirements of vectors capable of 
transforming plant cells for the sake of ultimately causing a 
determined foreign gene product to be inserted in chloro- 
plasts were determined. 

In the examples which follow the approach taken has been 
to construct a chimaeric gene encoding a fusion protein 
containing the transit peptide of the precursor to the small 
sub-unit of RuBP carboxylase from pea and a coding 

_ sequence of the bacterial neomycin phosphotransferase (ID) 
{abbrevited as NPT). 

The NPT(D protein was chosen because NPTQD protein 
confers resistance for kanamycin to plants (HERRERA- 
ESTRELLA et al, 1983; FRALEY et al., 1983; BEVAN et 


al.. 1983), fusion proteins are biologically active (Reiss etal, 


1984b) and an enzymatic assay for in situ detection of 
NPT(ID or NPT fusion proteins in non-denaturing poly- 
acrylamide gels has been described recently*’. This method 
is particularly useful to distinguish processed from unproec- 
essed forms of the fusion protein. 


12 


stituted by pBR322 (ZAMBRYSKI et al. 1983; 1984). The 
T-DNA of pGV3851 still contains the genes coding for 
transcripts 4, 6a and 6b (WILLMITZER et al, 1983) which 
results in a teratoma-like growth of the transformed tissue 
(JOOS et al, 1983), whereas all tumor controlling genes 
have been eliminated in pGV3850 with the result that plant 
cells transformed with this vector can differentiate and grow 
as normal plants (ZAMBRYSKI et al., 1983; DE BLOCK et 
al, 1984). The gene constructions were introduced into 
pGV3850 and pGV3851 Ti-plasmids by homologous 
recombination after mobilization from E. coli to Agrobac- 
tecium with the help of plasmids R64drd ll and J G28 (VAN 
HAUTE et al, 1983). 

Cointegrates were selected on spectinomycin and strep- 
tomycin containing plates and their structure verified by 
Southern biot hybridization (SOUTHERN. 1975) using vari- 
ous parts of the constructions as probes {data not shown). 


Plant Transformation 


The chimacric genes were introduced into Nicofiana 
tabacum cy. Wisconsin 38 or SR1 by inoculation of a 
wounded plantlet or by co-cultivation of protoplasts with 


es Agrobacterium. Transformed material obtained by wound- 
GENERAL OUTLINE z5 ing was screened for the presence of nopaline synthase 


Construction of Plasruids pSNIP and pSNIP 
Containing the Chimaeric Gene (tp-ss-nptlt) 


A genomic clone for one of the rbcS genes from pea was 
isolated, sequenced and made available by Dr. A. 
CASHMORE, Rockefeller University, New York (pPSR6). 
From this clone the promoter signals, (CASHMORE, 1983; 
HERRERA-ESTRELLA et al., 1984), the first exon coding 
for the rbcS transit peptide and the first two codons of the 
mature small subunit protein. followed by the first intron (83 
pb) and pat of the second exon (66 bp) coding for the amino 
terminus of the mature small subunit protein were used via 
a Sau3A restriction endonuclease recognition site with the 
BamHI site of the plasmid pKm109/9 (REISS et al.. 1984b) 
which contains the coding region for the nptll gene from 
Tn5 (BECK et al, 1982). 

The fusion gene which was obtained and which contained 
the transit sequence (S6 codons) and 22 codons from the 
mature rbcS gene linked via seven artificial codons with the 
second codon from the npill gene (FIG. 18) were found to 
be similarily active. The size of the coding region of the 
nptil gene is 1130 bp. The fusion junction was verified (data 
not shown) by DNA sequencing (MAXAM and GILBERT, 
1977). The chimaeric protein should have a size of Mr 
38.023 in the unprocessed and of Mr 32.298 in the processed 
form. Southern type (SOUTHERN, 1975) hybridization data 
(FIG. 2) established that transformed plant tissues contained 
the chimaeric gene constructs in the nuclear DNA and that 
no detectable DNA rearrangements had occured during 
integration. A schematic represeniation of the results is 
given in FIG. 3. 
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A more detailed disclosure of the construction will be 
given hereafter, more particularly in relation to FIG. 1A, 18 
and 3. 


Production of Vectors Capable of Transforming 
Planis 


To introduce the chimeric genes in the nuclear genome 
of plants. the plasmid was inserted into the T-DNA of 
pGV3851 and of pGV3850. both derivatives of the Ti 
plasmid pTIC58. in which parts of the T-DNA where sub- 


65 


activity (OTTEN, 1982), a cotransferred marker. Transfor- 
mants (pGV3851::DSNIP) grew on 250 ug/ml kanamycin as 
green teratoma tissue, suggesting that a functional chimacric 
gene was present and transcribed. In co-cultivation experi- 
ments N. tabacum SR1 protoplasts were incubated with 
Agrobacterium containing (pGV3850::SNIF) and selected 
after two weeks with 100 ug/ml kanamycin. From 9 indi- 
vidual colonies which were positive when tested for NPTH 
activity. one was chosen and regenerated undet constant 
selected pressure to a fully normal icoking plant. Genetic 
analysis shows inheritance of the NPTI marker in a clas- 
sical mendelian fashion. These results suggested that tran- 
scripts from the chimaeric genes were properly processed, 
transported out of the nucleus and translated into a func- 
tionally active protcin. 


Light Induction of the Chimaeric Gene 


Poly(A}+ and poly(A}-RNA from wild type and ftom 
transformed tissues (pGV3851::pSNIP) was isolated and 
analysed by so called “Northern” gel hybridizations. When 
the coding region of the nptli gene (BamHI- Smal fragment 
from pKM109/9) was used as a probe a complex Aybric- 
ization pattern was observed with RNAs ranging between 
5.500 nuclectides and 8,000 nucleotides in size. These 
RNAs were detected in light grown teratomas only. Four 
days of darkness after a day-night rhythm of twelve hours 
resulted in a marked decrease of the signals (FIG. 4). The 
very large size of these transcripts probably results from the 
fact that no proper polyadenylation and transcription texmi- 
pation site was introduced near the translation termination 
signal. No signals of compatable size or strength were 
observed in wild type Wisconsin 38 tobacco or in material 
obtained from a plant transformed with the pGV3850 vector 
only (FIG. 4). In order to compare the light dependent 
transcription of the chimacric gene with that of both the 
endogenous rbcS gene and the chlorcplast gene coding for 
the large subunit of Rubisco (rbcL). poly(A} and poly(A) 
~RNA from light- and dark-grown teratoma were hybridized 
to specific probes for each of these genes. The results are 
illustrated in FIG. 5A. Signals of the endogenous rbcS 
transcripts (850 nucleotides) were observed at the expected 
position. Similarly, a transcript of 1750 nucleotides was 


